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Abstract — Discovering frequent episodes in event sequences is 
an interesting data mining task. In this paper, we argue tliat this 
frameworli is very effective for analyzing multi-neuronal spilie 
train data. Analyzing spike train data is an important problem 
in neuroscience though there are no data mining approaches 
reported for this. Motivated by this application, we introduce 
different temporal constraints on the occurrences of episodes. 
We present algorithms for discovering frequent episodes under 
temporal constraints. Through simulations, we show that our 
method is very effective for analyzing spike train data for 
unearthing underlying connectivity patterns. 

I. Introduction 

Temporal data mining is concerned with mining of large 
sequential data sets [8]. Frequent episode discovery, originally 
proposed in [1 1], is one of the popular frameworks in temporal 
data mining. Here the data is viewed as a single long sequence 
of events and the task is to unearth temporal patterns (called 
episodes) that occur sufficiently often along that sequence. 
Examples of such data are alarms in a telecommunication 
network, fault logs of a manufacturing plant, multi-neuronal 
spike train recordings, etc. 

In this paper we present new algorithms for discover- 
ing frequent episodes under some temporal constraints. The 
motivation for considering such constraints comes from the 
application that we discuss here, namely, analyzing multi- 
neuron spiking data to infer useful information about the 
underlying microcircuits. Such neuron spike train data can 
be obtained through techniques such as microelectrode array 
experiments. Analyzing simultaneously recorded data from a 
number of neurons is an important and challenging problem. 
The data consists of spike trains from a number of neu- 
rons. Since functionally interconnected neurons tend to fire 
in certain precise patterns, discovering frequent patterns in 
such temporal data can help understand the underlying neural 
circuitry. Here, we argue that the frequent episodes framework 
is ideally suited for such analysis. However, as we shall see, 
for this application we need methods to discover frequent 
episodes where the occurrences of episodes need to satisfy 
some additional temporal constraints. The currently available 
methods for frequent episode discovery can not tackle such 
constraints. In this paper, we present some new algorithms for 
frequent episode discovery under such temporal constraints. 

We explain the problem of analyzing multi-neuronal spike 
train data in Section|II] We then present a brief overview of the 
frequent episodes framework in Section |lll] We introduce the 
notion of temporal constraints on the episode occurrences and 



explain how one can use methods of serial and parallel episode 
discovery under temporal constraints, to discover many pat- 
terns of interest in the spike train data. The algorithms for 
discovering frequent episodes under temporal constraints are 
presented in Section |IV] We present some simulation results 
to illustrate our method of discovering connection patterns 
in neuronal networks in Section |V] Finally, we conclude the 
paper in Section [VT] with a discussion. 

II. Multi-neuronal Spike Train Data 

Over the last couple of decades many new technologies 
have made it possible to simultaneously record signals from 
many neurons and hence to study microcircuits in neuronal 
assemblies. Microelectrode array (MEA) is one such popular 
recording technology. A typical MEA setup consists of 8 x 8 
grid of 64 electrodes with inter-electrode spacing of about 
25 microns and can be mounted on a neural culture or brain 
slice. Other technologies for recording from multiple neurons 
include imaging of neuronal currents using some specialized 
dyes. These technologies now allow for gathering of vast 
amounts of data, especially in neuronal cultures, using which 
one wishes to study connectivity patterns and microcircuits in 
neural systems. (See [13], [6] for some recent studies of this 
kind). 

The availability of vast amounts of such data means that 
developing efficient methods to analyze neuronal spike trains 
is a challenging task of immediate utility in this area. A recent 
review by Brown et.al. summarizes the current state of art [4]. 
Most of the current methods of analysis rely on quantities 
that can be computed through cross correlations among spike 
trains (time shifted with respect to one another) to identify 
interesting patterns in spiking activity [4]. There are also 
methods that look for specific fixed patterns and assess their 
statistical significance under a null hypothesis that different 
spike trains are iid Bernoulli processes [2], [10], [12]. Most 
such methods can not look for patterns that involve more than 
3 or 4 neurons due to the ubiquitous curse of dimensionality. 
Hence model-free techniques such as data mining can be very 
useful in unearthing interesting patterns in the spike trains. 

The patterns that one is interested in this application can 
be roughly grouped into what are called Synchrony, Order 
and Synfire chains. Synchronous firing by a group of neurons 
is interesting because it can be an efficient way to transmit 
information [5]. Ordered firing sequences of neurons where 
times between firing of successive neurons are fairly constant 



denote a chain of triggering events and unearthing such 
relations between neurons can thus reveal some microcircuits 
[1]. Such an ordered chain may be among groups of neurons 
rather than single neurons. Such a pattern is called a Synfire 
chain and is believed to be a very important microcircuit [6]. 
In the next section, we explain how all such patterns can 
be discovered under the framework of frequent episodes with 
temporal constraints. 

III. Frequent Episode Discovery 

Frequent episode discovery framework was proposed by 
Mannila et.al. [11] in the context analyzing alarm sequences 
in a communication network. Laxman et.al. [9] introduced the 
notion of non-overlapped occurrences as episode frequency 
and proposed efficient counting algorithms. We first give a 
brief overview of this framework. 

The data to be analyzed is a sequence of events denoted 
by {{El, ti), {E2, ^2), • ■ •) where Ei represents the event type 
and ti the time of occurrence of the i*^ event. Ei's are drawn 
from a finite set of event types. The sequence is ordered with 
respect to times of occurrences so that, ti < i^+i, Vi. The 
following is an example event sequence containing 7 events 
with 5 event types. 

{{A, 1), (B, 3), {D, 4), (C, 6), {A, 12), {E, 14), (B, 15)) (1) 

In multi-neuron data, a spike event has the label of the 
neuron (or the electrode number in case of micro-electrode 
array recordings) which generated the spike as its event type 
and has the associated time of occurrence. The neurons in the 
ensemble under observation fire action potentials (or spikes) 
at different times. AU these spike events are strung together, 
in time order, to give a single long data sequence as needed 
for frequent episode discovery. 

The general temporal patterns that we wish to discover in 
this framework are called episodes. In this paper we shall deal 
with two types of episodes: Serial and Parallel. 

A serial episode is an ordered tuple of event types. For 
example, (A — > i? — > C) is a 3-node serial episode. The 
arrows in this notation indicate the order of the events. Such 
an episode is said to occur in an event sequence if there are 
corresponding events in the prescribed order. In sequence ([T]i, 
the events {A, 1), {B, 3), (C, 6) constitute an occurrence of the 
above episode. In contrast a parallel episode is similar to an 
unordered set of items. We denote a 3-node parallel episode 
with event types A, B and C, as {ABC). An occurrence of 
[ABC) can have the events in any order in the sequence. The 
events (_B, 3), (C, 6), (A, 12) constitute an occurrence of the 
parallel episode {ABC). 

We note here that occurrence of an episode (of either 
type) does not require the associated event types to occur 
consecutively; there can be other intervening events between 
them. In the multi-neuronal data, if neuron A makes neuron B 
to fire, then, we expect to see B following A often. However, 
in different occurrences of such a substring, there may be 
different number of other spikes between A and B because 
many other neurons may also be spiking simultaneously. Thus, 
the episode structure allows us to unearth patterns in the 
presence of such noise in spike data. 



An episode /3 is a sub-episode of episode a if all event 
types of (3 are in a and if the order among the event types 
of (3 is same as that for the corresponding event types in a. 
For example {A ^ B), {A C), and {B C) are 2- 
node sub-episodes of the 3-node episode {A B C), 
while {B A) is not. In case of parallel episodes, there 
is no ordering requirement. Hence every subset of the set of 
event types of an episode is a subepisode. It is to be noted 
here that occurrence of an episode implies occurrence of all 
its subepisodes. 

Frequency of an episode is some measure of how often an 
episode occurs in the data and there are different ways of 
defining it. Here, we use the frequency measure proposed in 
[9] known as non-overlapped occurrence count. A collection 
occurrences of an episode a are said to be non-overlapped if 
no event associated with one appears in between the events 
associated with any other The corresponding frequency for 
episode a is defined as the cardinality of the largest set of non- 
overlapped occurrences of a in the given event sequence. (See 
[9] for more discussion). This definition of frequency results 
in very efficient counting algorithms [9]. In the context of our 
application, counting non-overlapped occurrences is natural 
because we would then be looking at causative chains that 
happen at different times again and again. 

A. Temporal Constraints 

As stated earlier, in this paper we present algorithms for 
discovering frequent episodes where, while counting the fre- 
quency, we include only those occurrences which satisfy some 
additional temporal constraints. We mainly consider two types 
of such constraints: episode expiry time and inter-event time 
constraints. 

Given an episode occurrence (that is, a set of events in 
the data stream that constitute an occurrence of the episode), 
we call the largest time difference between any two events 
constituting the occurrence as the span of the occurrence. For 
serial episodes, this would be the difference between times of 
the first and the last events. The episode expiry time constraint 
requires that we count only those occurrences whose span is 
less than a (user-specified) time Tx. In the algorithm in [11], 
the window width essentially implements an upper bound on 
the span of occurrences. An efficient algorithm for counting 
non-overlapping occurrences of serial episodes that satisfy an 
expiry time constraint is available in [9]. 

The inter-event time constraint, which is meaningful only 
for serial episodes, is specified by giving an interval of the 
form {Tiow, Thigh] and requires that the difference between 
the times of every pair of successive events in any occurrence 
of a serial episode should be in this interval. In a generalized 
form of this constraint, we may have different time intervals 
for different pairs of events. In the next section, we present al- 
gorithms for counting non-overlapped occurrences of episodes 
under time constraints. 

While these temporal constraints are motivated by our 
application, these are fairly general and would be useful in 
many other applications of frequent episode discovery. 



B. Episodes as patterns in neuronal spike data 

The analysis requirements of spike train data are met very 
well by the frequent episodes framework. Serial and parallel 
episodes with appropriate temporal constraints can capture 
many patterns of interest in multi-neuronal data. 




(a) Synchrony (b) Order 




(c) Synfire Chain 



Fig. 1 . Examples of neuronal connection structures that can result in different 
patterns in the spike trains 

As stated earlier, one of the patterns of interest is Syn- 
chrony or co-spiking activity in which groups of neurons fire 
synchronously. This kind of synchrony may not be precise. 
Allowing for some amount of variability, co-spiking activity 
requires that all neurons must fire within a small interval of 
time of each other (in any order) for them to be grouped 
together Such synchronous firing patterns may be generated 
using the structure as shown in Fig. WL^)- Such patterns of 
Synchrony can be discovered by looking for frequent parallel 
episodes which satisfy an expiry time constraint. The expiry 
time here controls the amount of variability allowed for 
declaring a grouped activity as synchronous. 

Another pattern in spike data is ordered firings. A simple 
mechanism that can generate ordered firing sequences is shown 
in Fig. [TJb). Serial episodes capture such a pattern very well. 
Once again, we may need some additional time constraints. 
A useful constraint is that of inter-event time constraint. In 
multi-neuron data, if we want to conclude that A is causing 
B to fire, then B cannot occur too soon after A because there 
would be some propagation delay and B can not occur too 
much later than A because the effect of firing of A would 
not last indefinitely. For example, we can prescribe that inter- 
event times to be of the same order as the synaptic delay times 
so that a frequent serial episode may capture an underlying 
microcircuit. Thus, serial episodes with proper inter-event time 
constraints can capture ordered firing sequences which may be 
due to underlying functional connectivity. 

Another important pattern in spiking data is that of synfire 
chains [6]. This consists of groups of synchronously firing 
neurons strung together with tight temporal constraints. By a 
combination of parallel and serial episode discovery we can 
unearth such patterns also. The structure shown in Fig. [TJc) 
captures such a synfire chain. We can think of this as a 



microcircuit where A primes synchronous firing of {BCD), 
which, through E, causes synchronous firing of (FGHI) and 
so on. When such a pattern occurs often in the spike train data, 
parallel episodes like (BCD) and (FGHI) become frequent 
(by using appropriate expiry time constraint). After discov- 
ering all such parallel episodes, we replace all recognized 
occurrences of each of these episodes by a new event in the 
data stream with a new symbol (representing the episode) for 
the event type and an appropriate time of occurrence. Then 
we discover serial episodes on this new data stream. With this 
procedure, we can unearth patterns such as synfire chains. We 
show later that our algorithms can discover such synfire chains. 

IV. Discovering frequent episodes under 

TEMPORAL CONSTRAINTS 

In this section we describe our algorithms that discover fre- 
quent episodes under expiry and inter-event time constraints. 
The inter-event time constraints are meaningful only for serial 
episodes and that is the case we consider. Since algorithms 
for taking care of expiry time are available in case of serial 
episodes [9], we consider the case of only parallel episodes 
under expiry time constraint. 

A frequent episode is one whose frequency exceeds a user 
specified threshold. The overall objective is to find all frequent 
episodes. Counting of all possible episodes is infeasible in 
most real problems due to combinatorial explosion. As is 
common in such data mining methods, we use a level-wise 
Apriori style [3] procedure. Under this we use frequent iV- 
node episodes to create N-i-l-node candidates, and, using 
another pass over the data, obtain frequent N-i-l-node episodes. 
The basic structure of the frequency counting algorithm is 
similar to the ones in [11], [9] and we also use finite state 
automata for recognizing episode occurrences. 

A. Serial Episode with Inter-event Constraints 

Under an inter-event time constraint, the time between 
successive events in any occurrence have to be in a prescribed 
interval. To take care of this we use a new episodes structure. 
The episode structure now consists of an ordered set of 
intervals besides the set of event types. An interval (^lo^,, 
is associated with i*^ pair of consecutive of event types in the 
episode. For example, a 4-node serial episode is now denoted 
as follows: 

(A -^B -^C ^D) (2) 

In a given occurrence of episode A B C D let Ia, 
tB, tc and to denote the time of occurrence of corresponding 
event types. Then this is a valid occurrence of the serial 
episode with inter-event time constraint given by (|2]i, if tj^^^ 
< (tB-tA) < ti^gh, tl^ < (tc-tB) < tl^gh and t'l^^ < 

{tD - tc) < tlg^. 

In general, an A^-node serial episode is associated with, A^ — 
1 inter-event constraints of the form (ijotu , i/i^g/J ■ Episode dis- 
covery under such constraints involves discovery of frequent 
serial episodes along with discovery of the most appropriate 
inter-event constraint for every pair of nodes. In this subsection 
we present an algorithm for this where the user provides a set 



of non-overlapped intervals to serve as candidates for inter- 
event time constraints. An important special case is one where 
the same interval is to be used for all inter-event constraints 
and our general algorithm can easily be specialized for this 
case. 

1 ) Candidate generation scheme: The candidate generation 
schemes in [11], [9] require that the frequency of an episode 
is less that or equal to that of all its subepisodes. This is not 
true when we have inter-event time constraints. For example, if 
episode (A^^ -3 C) is frequent, the sub-episodes (A^^ 

B) and (5^—3 C) would be frequent, but for the subepisode 

(?,?] 

{A — ► C) the inter-event constraint is not intuitive. Hence, 
we use a different candidate generation scheme here. 

The candidate episodes in this case are generated as follows. 
Let a and (3 be two fc-node frequent episodes such that by 
dropping the first node of a and the last node of /3, we get 
exactly the same [k — l)-node episode. A candidate episode 7 
is generated by copying the fc-event types and (fc — l)-intervals 
of a into 7 and then copying the last event type of /3 into the 
[k + 1)*'' event type of 7 and the last interval of (3 to the fc*'* 
interval of 7. Fig. |2] shows the candidate generation process 
graphically. 




;_ioj5|^ 



Fig. 2. Visualization of Candidate generation for serial episodes with inter- 
event constraints 

2) Counting episodes with generalized inter-event time con- 
straint: We first explain the need for a new algorithm to 
count occurrences of serial episode under inter-event time 
constraints. Consider the event sequence 

{{A, 1), (A 2), {B, 4), {A, 5), (C, 10), {B, 12), (C, 13), {D, 17)). 

(0 5] (5 lot 

Let the serial episode under consideration be {A-^ B—^ — > 

C^— -! D). All the current algorithms for counting occurrences 
of serial episodes either look at left most occurrence of episode 
or inner most occurrence of episode (See [7] for details). In 
the given event sequence, the left most occurrence is {{A, 1), 
{B,A), (C, 10), (D, 17)) and the inner most occurrence is 
((A, 5), (-8,12), (C,13), (£',17)), where as only the occur- 
rence {{A, 2), (B, 4), (C, 13), (D, 17)) satisfies the inter-event 
constraints. 

The counting algorithm is listed as Algorithm 2 in the 
Appendix. It uses waits lists indexed by event types and a 
linked list of node structures for each episode as the basic 
data-structures. The entries in the waits lists are nodes. For 
each episode we have a doubly linked list of node structures 
with a node corresponding to each of the event types and 
arranged in the same order as that of the episode. The node 
structure has a tlist field that stores the times of occurrence of 
the event-type represented by its corresponding riode. Other 
field in the node structure is visited, which is a boolean field 
that indicates whether the event type is seen atleast once. 



On seeing an event type Ei, the algorithm iterates over 
list waits{Ei) and updates each node in the list. We explain 
the procedure for updating the nodes, by considering the the 
example sequence given in ^ and the episode a — (A^— ► 
b'"-^ — ! (7^*^ D). Working of the algorithm in this example 
is illustrated in Fig. |3] 

The waits lists are initialized by adding the nodes cor- 
responding to first event type of each episode in the set of 
candidates to the corresponding waits{.) list. In the example, 
let the node tracking event type A be denoted by nodeA, 
and so on. Initially waits{A) contains nodeA- The boxes in 
Fig.Orepresent an entry in the tlist of a node. An empty box 
is one that is waiting for the first occurrence of an event type. 
On seeing {A,l), it is added to tlist of nodeA, and nodes 
is added to waits{B). At any time, the node structures are 
waiting for all event types that have been already seen and 
the next unseen event type. 

Sometime later, at t = 4, the first occurrence of a i? is 
seen. The tlist of nodeA is traversed to find atleast one 
occurrence of A, such that ts — tA £ (0,5]. Both (A, 1) 
and {A, 2) satisfy this and hence, (-6,4) is accepted into 
the nodes -tlist. The rule for accepting an occurrence of an 
event type (which is not the first event type of the episode) 
is that there must be atleast one occurrence of the previous 
event type (in this example A) which can be paired with 
the occurrence of the current event type (in this example 
B) without violating the inter-event constraint. Note that this 
check is not necessary for the first event of the episode. After 
seeing the first occurrence of B, nodec is added to waits{C). 
Using the above rules the algorithms accepts {A, 5), (C, 10) 
into the corresponding tlists. At t ^ 12, for (_B, 12) none of 
the entries in nodeA-tlist satisfy the inter-event constraint for 
the pair A ^ B. Hence (_B, 12) is not added to the tlist of 
node B - Rest of the steps of the algorithm are illustrated in the 
figure. 

If an occurrence of event type is added to node.tlist, it is 
because there exist events for each event type from the first 
to the event type corresponding to the node, which satisfy 
the respective inter-event time constraints. An occurrence of 
episode is complete when an occurrence of the last event type 
can be added to the tlist of the last node structure tracking 
the episode. 

The tlist entries shown crossed out in the figure are the 
ones that can be deallocated from the memory. In the example, 
at t = 12, when the algorithm tries to insert {B, 12) into 
nodes -tlist, the list of tlist entries for occurrences of A's 
is traversed. [A, 1) with inter-event constraint (0, 5] can no 
longer be paired with a B since the inter-event time duration 
for any incoming event exceeds 5, hence {A,l) can be 
safely removed from the nodeA-tlist. This holds for {A, 2) 
and [A, 5) as well. In this way the algorithm frees memory 
wherever possible without additional processing burden. 

Many times, in addition to counting frequencies, we may 
want to be able to track all the occurrances of episodes 
that were counted. For this, we need to store sufficient back 
references in the data. This adds some memory overhead, but 
tracking may be useful in visualizing the discovered episodes. 
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Fig. 3. Visualization of Algorithm [T] 



the noise level) in the system. An absolute refractory period 
is also used. This is the short time after a spike in which the 
neuron cannot respond to another stimulus. 

A. Network patterns 

In this section, we demonstrate how we can obtain useful 
information about the structure of the underlying network 
using combination of serial and parallel episode discovery. 
Using the simulation model described above, we can embed 
different types of network patterns. Fig. [T] shows examples 
of types of inter connections we make to embed different 
patterns. For this in the simulator we make these required 
connections between neurons have high weights. (In addition 
there are also random interconnections among neurons). We 
discuss three examples in this section. 
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B. Parallel episodes with expiry 

Since parallel episodes do not need any order on the 
events, it is reletively simpler to count their occurrances. We 
specialized the parallel episode discovery algorithm presented 
in [7] to handle expiry time constraint. That is, we count the 
number of non-overlapped occurrences of a set of parallel 
episodes in which all the constituting events occur within time 
Tx of each other Since the modifications needed are simple, 
due to space limitations we do not provide the details here. 

V. Simulation Results 

In this section we present some results obtained using syn- 
thetic data as well as some real neuronal data. We also discuss 
the issue of statistical significance of discovered episodes. 

We used a simulation model to generate data that would 
resemble actual multi-neuronal recordings. Spike train of each 
neuron is modeled as an inhomogeneous poisson process. 
Neurons in the network are randomly interconnected. Each 
connection is assigned a weight. For the random intercon- 
nections, the weight attached to each synapse is set using a 
uniform distribution over [— c, c] where c is chosen to be 
relatively small. When we want to embed any specific pattern, 
then, we set the weights of the required connections between 
neurons to a higher value. 

The spike trains of each neuron is simulated as a rate 
varying poisson process. The spiking rates of neurons are 
updated every AT using the following. 

Amax 



A,(fc) 



1 



(4) 



cxp(-/j(fc) + d) 

where Xj{k) is the firing rate of j*^^ neuron at time fcAT 
and Ij{k) is its total input at that time. Ij{k) = '^Oi{k)wij 
where Oi{k) is the output of i*^ neuron and Wij is the 
weight of synapse from i*'' to j*^ neuron. Oi{k) is taken to 
be the number of spikes by the i*'' neuron in the interval 
( (fc - /i.)AT, {k-h - 1)AT] where h is a. small integer that 
represents the synaptic delay in units of AT. In (|4] Xmax is the 
maximum firing rate and d determines the resting spiking rate 
(i.e. when input is zero). This is the quiescent firing rate (or 



Fig. 4. Network pattern for Example 1 

Example 1: In a 26 neurons network (where each neuron 
corresponds to an alphabet) we embed the pattern shown in 
FigH The simulation is run for 50 sec and approximately 
25,000 spikes are generated. The synaptic delay is set to be 
about 5 milli sec. We have chosen AT — 1 milli sec and have 
taken refractory time also to be the same. 
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Parallel episodes for Example 1 
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.01 


.29 


2(4) 
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.01 
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2(4) 
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D F : 348; F D : 323 


.002-.004 


.01 


.26 


1(26) 


no 2 or more 
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.004-.006 


.01 


.29 


4(4) 


A B C D : 597 
A B E F : 589 
A B E D : 530 
A B C F : 530 



TABLE II 
Serial episodes Example 1 



The sequence is then mined for frequent parallel episodes 
with different expiry times. The results are given in Table U 
The table shows the expiry time used, the frequency threshold, 
time taken by the algorithm on a Intel dual core PC running 



at 1 .6 GHz, the size of the largest frequent episode discovered 
and the number of episodes of this size along with the actual 
episodes. We follow the same structure for all the tables. The 
frequency threshold is expressed as a fraction of the entire data 
length. A threshold of 0.01 over a data length of 25,000 spike 
events requires an episode to occur atleast 250 times before it 
is declared as frequent. From TableUit can be seen that (CE) 
and (DF) turn out to be the only frequent parallel episodes 
if the expiry time is 1 to 2 milli sec. If the expiry time is too 
small, we get no frequent episodes (at this threshold). On the 
other hand, if we increase the expiry time to be 7 milli sec 
which is greater than a synaptic delay, then even [FEDC) 
turns out to be a parallel episode. This shows that by using 
appropriate expiry time, parallel episodes discovered capture 
synchronous firing patterns. 

The results of serial episode discovery are shown in Table 
nil With an inter-event constraint of 4-6 milli sec, we discover 
all paths in the network (Fig. |4]i. When we prescribe that 
inter-event time be less than 2 milli sec (when synaptic 
delay is 5 milli sec), we get nodes in the same level as our 
serial episodes. If we use intervals of 2-4 milli sec, we get 
no episodes because synchronous firings mostly occur much 
closer and firings related by a synapse have a delay of 5 milli 
sec. Thus, using inter-event time constraints, we can get fair 
amount of information of the underlying connection structure. 
It may seem surprising that we also discover A i? — > C 
F and A ^ B ^ E D when we use 4-6 milli sec 
constraint. This is because, the network structure is such that 
D and F fire about one synaptic delay time after the firing of 
C and E. Thus, the serial episodes give the sequential structure 
in the firings which could, of course, be generated by different 
interconnections. The frequent episodes discovered provide a 
handle to unearthing the hierarchy seen in the data (i.e. which 
events co-occur and which ones follow one another). 

Example 2: In this example we consider the network 
connectivity pattern as shown in Fig. [lie). As stated earlier, 
this is an example of a Synfire chain. We use the same 
parameters in the simulator as in Example 1 and generate 
spike trains data using this connectivity pattern. Table HiH 
shows the parallel episodes discovered and Table |IV] shows 
the serial episodes discovered with different inter-event con- 
straints. From the tables, it is easily seen that parallel episodes 
with expiry time of 1 milli sec and serial episodes with inter- 
event time constraint of about one synaptic delay, together 
give good information about underlying network structure. In 
this example, we illustrate how our algorithms can discover 
synfire chain patterns. We first discover all parallel episodes 
with expiry time 1 milli sec. Then for each frequent parallel 
episode, we replace each of its occurrences in the data stream 
by a new event with event type being the name of the parallel 
episode. This new event is put in with a time of occurrence 
which is the mean time in the episode occurrence. We then 
discover all serial episodes with different inter-event time 
constraints. The results obtained with this method are shown 
in Table |V] As can be seen, the only pattern we discover 
is the underlying synfire chain. This example shows that by 
proper combination of parallel and serial episodes, we can 
obtain fairly rich pattern structures which are of interest in 



neuronal spike train analysis. 



Episode 
expiry 


Freq. 
Th. 


Time 

(sec) 


Size 
(No.) 


Patterns 
Discovered 


.001 


.01 


.15 


4(1) 


L K : 307 
C B D : 293 
H G F I : 268 
rest are 
sub-episodes 



TABLE III 
Parallel episodes for Example 2 



Inter-event 
interval 


Freq. 
Th. 


Time 
(sec) 


Size 
(No.) 


Patterns 
Discovered 


.002-.004 


.01 


.157 


1(26) 


no episodes ot 2 
or more nodes 


.004-.006 


.01 


.469 


6(24) 


A D E H J K : 195 
A D E I J K : 194 
A D E H J L : 193 
A C E H J K : 192 


.006-.008 


.01 


.156 


1(26) 


no episodes of 2 
or more nodes 



TABLE IV 
Serial episodes for Example 2 



Inter-event 
interval 


Freq. 
Th. 


Time 
(sec) 


Size 
(No.) 


Patterns 
Discovered 


.002-.004 


.01 


.11 


1(20) 


no episodes ot 
2 or more nodes 


.004-.006 


.01 


.14 


6(1) 


A IC B DJ E 

[H G F I] J [L K] : 137 


.006-.008 


.01 


.12 


1(20) 


no episodes ot 
2 or more nodes 



TABLE V 
Synfire episodes for Example 2 



Example 3: In this example, we choose a network pattern 
where different pairs of interconnected neurons can have 
different synaptic delays and we demonstrate the ability of 
our algorithm to automatically discover appropriate inter-event 
intervals. The pattern is shown in Fig. |5] where we have 
different synaptic delays as indicated on the figure. 




Fig. 5. Network Pattern for Example 3 

The results for parallel episode discovery (see Table IVII ) 
show that (ABC) is the group of neurons that co-spike 
together The serial episode discovery results are given in Table 
IVIII As can be seen from the table, with different pre-specified 
inter-event time constraints we can discover only different 
parts of the underlying network graph because no single inter- 
event constraint captures the full pattern. 

As in Example 2, we replace occurrences of parallel episode 
with a new event in the data stream. We then run Algorithm [T| 



Episode 


Freq. 


Time 


Size 


Patterns 


expiry 


Th. 


(sec) 


(No.) 


Discovered 


.001 


.01 


.28 


3(1) 


A B C : 614 


.002 


.01 


.25 


3(1) 


A B C : 617 


.004 


.01 


.28 


4(1) 


A B C D : 537 


.006 


.01 


.32 


4(2) 


X A B C : 602 
A B C D ; 542 



TABLE VI 
Parallel episodes for Example 3 



Inter-event 
interval 


Freq. 
Th. 


Time 
(sec) 


Size 
(No.) 


Patterns 
Discovered 


.000-.002 


.01 


.32 


2(6) 


A C : 385; B A ; 376 
B C : 373; A B ; 372 
C A ; 361; C B : 355 


.002-.004 


.01 


.37 


2(4) 


E F : 783; A D : 656 
C D : 651; B D : 646 


.004-.006 


.01 


.28 


2(3) 


X A : 790; X B : 774 
X C ; 769 


.006-.008 


.01 


.29 


2(2) 


D E : 720; X D ; 454 



TABLE Vll 
Serial Episodes for Example 3 



Inter-event 
interval 


Freq. 
Th. 


Time 
(sec) 


Size 
(No.) 


Patterns 
Discovered 


{.000-.002, 

.002-.004, 

.004-.006, 

.006-.008, 
.008-.010} 


.01 


1.37 


5(1) 


.^.UU4-.()Uii 

[ABC] 

^.006-. 008 

.002-. 004 
hi ' r 

: 372 



TABLE VIII 
Synfire episodes for Example 3 



to discover serial episodes along with inter-event constraints, 
given a set of possible inter-event intervals. The results ob- 
tained are shown in Table IVIlTl As can be seen from the table, 
the algorithm is very effective in unearthing the underlying 
network pattern. 

B. Significance of discovered patterns 

The examples above show that if we generate spike data 
using special embedded patterns in it then our algorithms can 
detect them. However, this does not answer the question; if 
the algorithm detects some frequent episodes what confidence 
do we have that they correspond to some patterns. Here, we 
try to answer this question by showing that it is unlikely to 
have long frequent episodes if the data generation model does 
not have any specific biases. We generate such random data 
as follows. We use the same simulator but with only random 
interconnection weights and no specially introduced strong 
connections. We generated ten sets of random interconnection 
weights and for each set we generated ten sets of data (25 
000 spikes) by running the simulator with those weights. 
Thus we have 100 data sets in which while neurons still 
fire with input dependent firing rates, there are no special 
causative connections. Apart from this we generated another 
50 data sets where the firing rates of neurons are chosen 
randomly at each AT. We then discover serial episodes of 



size up to 10 with a frequency threshold of zero so that we 
get frequencies for all episodes. Table |IX] shows maximum 
frequency (averaged over the 150 data sets) versus size of 
episodes that we obtained. We have also generated 20 data 
sets in which a long ordered chain is embedded. The table 
also shows the minimum frequency (averaged over the 20 data 
sets) versus size for episodes which are subepisodes of the 
embedded chain. From the table it can be seen that even for 
size 2, the maximum frequency of an episode in the random 
data is very small. From size 3 onwards, all episodes have 
frequency less than 10. On the other hand, when the data 
contains patterns, even the minimum observed frequencies of 
that size episodes are about two orders of magnitude larger 
This provides sufficient statistical justification that it is higly 
unlikely to have long episodes with appreciable frequencies if 
the data source does not have the necessary bias. 



Size 


26 event types 




Noise sequence 


Sequences with patterns 




Avg. Max. Episode 


Avg. Min. Sub-Episode 




Frequency 


Frequency 


1-Node 


1050.57 


967.80 


2-Node 


61.47 


845.65 


3-Node 


8.51 


734.55 


4-Node 


3.31 


647.30 


5-Node 


2.03 


576.06 


6-Node 


1.25 


515.88 


7-Node 


.13 


466.33 


8-Node 


.12 


423.58 


9-Node 


.12 


385.25 


10-Node 


.12 


353.88 




Sample size = 150 


Sample size = 20 



TABLE IX 



Serial Episode frequencies in random and patterned data 



C. Analysis of multi-neuron data 

In this section we describe results obtained on calcium 
imaging data reported in [6]. (We are grateful to Dr. Rafeal 
Yuste for sharing this Calcium Imaging data with us). In 
[6], Ikegaya et. al. analyzed how neural activity propagates 
through cortical networks. They found precise repetitions of 
spontaneous patterns. These patterns repeated after minutes 
maintaining millisecond accuracy. In Fig. 3A of [6], such 
patterns are shown in raster plots by connecting the spikes 
that are part of an occurrence. 

In Fig. |6] we show results obtained on the same calcium 
imaging data set using frequent episode discovery algorithms. 
Fig.|6](a) shows two occurrences of a 10-node parallel episode 
discovered with expiry time constraint Tx = 10 time units. 
Fig.|6](b) shows four occurrences of two 4-node serial episode 
discovered with inter-event constraint of to 10 time units. It is 
seen that the results obtained using frequent episode discovery 
match with those presented in [6]. Also, the time needed by our 
algorithm is much smaller because in [6], they use a counting 
technique that cannot control the combinatorial explosion. This 
result brings out the utility of our data mining technique in 
terms of both effectiveness and efficiency. 
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(b) Frequent serial episodes of size 4 satisfying inter-event constraint = 10 
time units 

Fig. 6. Frequent episodes discovered using our algorithms on real data. 
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VI. Conclusion Appendix 

Frequent episode discovery is a very efficient temporal data 
mining technique. In this paper we have presented some new 
algorithms for frequent episode discovery under expiry time 
and inter-event time constraints. The temporal constraints are 
motivated by the problem of analyzing multi-neuron spiking 
data. We have discussed the kind of patterns that neurobiolo- 
gists look for in such data and have shown that our algorithms 
are very effective in unearthing the underlying connectivity 
structure from spike data. In this context, our temporal con- 
straints are very useful in focusing the search for patterns and 
tackling combinatorial explosion. Also, we can readily relate 
these constraints to relevant biological parameters. 

One of the main motivations for this paper is to introduce 
the problem multi-neuron spike data analysis to data mining 
community. This is a challenging problem of analyzing large 
data sets to find underlying patterns, though no data mining 
techniques have so far been used for this. One can think of 
this problem as one of learning network connectivity pattern 
given only node-level dynamic data. Such a problem would be 
relevant in many other application areas as well. For example, 
analyzing abnormal behavior of communication networks, 
finding hidden causative chains in complex manufacturing 
processes controlled by distributed controllers, etc. We hope 
our results presented here would contribute towards developing 
of data mining techniques relevant in such applications. 



Algorithm 1 Counting serial episodes with inter-event time 

constraints 

Input: Set C of A^-node episodes, event streams {{Ei,ti)), 

frequency threshold A,„i„ G [0, 1] 
Output: The set F of frequent episodes in C 



1: for all event types A do 

2: Initialize waits{A) = 

3: for all a G C do 

4: Set prev ~ (j) 

5: for i 1 to do 

6: Create node with node.visited = false; 
node. episode ~ a; node.index = i; 
node.prev ~ prev; node.next = (j) 

1: if i = 1 then 

8: Add node to waits{a[l]) 

9: if prev ^ then 

10: prev.next = node 

11: for i = 1 to 71 do 

12: for all node G waits{Ei) do 

13: Set accepted ~ false; a ~ node. episode; j — 

node.index; tlist = node.tlist 

14: if j < TV then 

15: for all tval G tlist do 

16: if [ti — tval.init) > a.thighlj] then 

17: Remove tval from tlist 

18: if J = 1 then 

19: Update accepted = true; tval.init = ti 

20: Add tval to tlist 

21: if node.visited = false then 

22: Update node.visited = true 

23: Add node.next to waits{a[j + 1]) 

24: else 

25: for all prev-tval G node.prev .tlist do 

26: ittt-prevJval G (a.tiowij -^,a.thigh[j -i]] 
then 

27: Update accepted = true; tval.init = ti 

28: Add tval to t^isi 

29: if node.visited = false then 

30: Update node.visited ~ true 

31: if node.index < N — 1 then 

32: Add node.next to 'u;aits(a[j + 1]) 

33: else 

34: if ti — prevJval > a.thigh[j — 1] then 

35: Remove prevdval from node. prev. tlist 

36: if accepted = true and node.index ~ N then 

37: Update a.freq = a.freq + 1 

38: Set temp = node 

39: while temp ^ do 

40: Update temp.visited = false 

41: if temp.index ^ 1 then 

42: Remove temp from ?i;azfs(Q;[ieTOp.mc?ea:]) 

43: Update temp = temp.next 

44: Output F = {a G C such that a.freq > nXmin} 



